Sub-Heisenberg estimation strategies are ineffective 
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In interferometry, sub-Heisenberg strategies claim to achieve a phase estimation error smaller than 
the inverse of the mean number of photons employed (Heisenberg bound). Here we show that one 
can achieve a comparable precision without performing any measurement, just using the large prior 
information that sub-Heisenberg strategies require. For uniform prior (i.e. no prior information), 
we prove that these strategies cannot achieve more than a fixed gain of about 1.73 over Heisenberg- 
limited interferometry. Analogous results hold for arbitrary single-mode prior distributions. These 
results extend also beyond interferometry: the effective error in estimating any parameter is lower 
bounded by a quantity proportional to the inverse expectation value (above a ground state) of the 
generator of translations of the parameter. 

PACS numbers: 03.65.Ta,03.67.Ac,07.60.Ly,42.50.Gy,06.20.-f,42.50.Lc 
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It has been known for a long time that, under very 
general conditions ]l|-[T(|, the precision A</> in determin- 
ing the phase in interferometry is bounded by the inverse 
of the mean total number of photons N used in the es- 
timation as A0 1/N: the Heisenberg bound to optical 
interferometry. Recent work [l2| has challenged the 
validity of such bound in regimes not covered by prior 
proofs, raising the question of whether it is possible to 
implement sub-Heisenberg strategies where the error has 
a scaling smaller than 1/N. A negative answer was given 
in [|l3| for those estimation strategies that are capable 
of reducing significantly the initial uncertainty on the 
phase. This result was obtained in the theoretical frame- 
work of quantum estimation theory |l4| , |j"5) , invoking the 
quantum speed limit theorem [|l6| - see also Ref. [ p"7| . 
Here we show that the same negative answer applies to es- 
sentially all estimation strategies: if sub-Heisenberg scal- 
ings could be attained, they would be of limited use as 
the overall gain in accuracy they provide is limited to a 
(small) constant factor over the prior information avail- 
able. To prove this, we exploit the quantum Ziv-Zakai 
(ZZ) lower bound to precision introduced by Tsang [[I8| , 
Differently from the quantum Cramer-Rao bound fl4|, |l5|| 
it is not tight in general and cannot be used to study the 
optimal performance of an estimation procedure, but it 
provides a bound on the achievable precision that explic- 
itly depends on the probability distribution characteriz- 
ing the prior information of the problem. The inequali- 
ties we obtain here are an extension of the ones of [[l3| , 
since here the prior information on the parameter to be 
estimated is included. 

We note that there have been prior debates on sub- 
Heisenberg scaling [fl9| glj, and proposals to use non- 
linear schemes [^2[ |23|j : a detailed review is found in [^5| . 

We start reviewing the ZZ bound |l8| showing that, in 
the regime of large information gain (Low Prior Informa- 
tion (LPI) regime) , it reproduces the results of Ref. [O . 



Namely, in this regime we prove that no sub-Heisenberg 
scaling can be attained, independently of the probability 
distribution that characterizes the prior information on 
the parameter. In the opposite regime (High Prior Infor- 
mation (HPI) regime) instead, we show that, even though 
sub-Heisenberg scalings are not prohibited by the ZZ 
bound, the resulting accuracy is of the same order of the 
one obtainable by guessing a random value distributed 
according to the prior distribution (without performing 
any measurement). This last result is derived under spe- 
cific, but reasonable, assumptions on the prior distribu- 
tion. In particular, wc show that for uniform prior the 
ZZ bound forces any sub-Heisenberg scheme to accura- 
cies which are only a meager 1.73 times larger than the 
one obtainable via a random guess. 

The final part of the paper contains material which is 
not directly related with the Heisenberg bound issue, but 
which makes use of the same technique developed in the 
previous sections. Specifically, generalizing previous re- 
sults of [ fl8| , we show how the quantum ZZ bound can be 
used to derive a weighted uncertainty relation weighted 
by generic prior distributions. 

Lower bound on the weighted precision: — Let x be the 
parameter to be measured, say the relative optical phase 
acquired by an optical pulse when traveling through the 
two arms of an interferometer. Wc allow x to take any 
possible real value and assign a prior probability distri- 
bution p(x) characterized by the quantity W which mea- 
sures the initial uncertainty of the problem - e.g. for p 
Gaussian, W can be identified with the standard devi- 
ation j^4|. As is customary in quantum metrology [ jl5| , 
we assume x to be encoded into a state p x of a quantum 
probe system (say the optical pulse emerging from the 
Mach-Zehnder interferometer) which we measure obtain- 
ing the random outcome y. We can quantify the accu- 
racy of the estimation through the weighted Root Mean 
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Square Error (RMSE) pi: 
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where X(y) is the estimation of x we construct from the 
outcome y, and where p(y\x) is the conditional probabil- 
ity distribution of getting a certain y given x. A lower 
bound for AY follows from the quantum ZZ bound p8|, 
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l-y/l-F(p x ,p x+z j\ }' 2 , (2) 



where F is the fidelity between the states p x and p x +z 
that correspond to a true value of the parameter of x 
and x + z respectively (an alternative, slightly stronger 
inequality is presented in the Appendix). The parameter 
x is mapped onto the probe state p x by a unitary 
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with H the generator of translations of x (an effective 
Hamiltonian). Then, a bound for the fidelity F is Jl3|, 



F( Px , Px+z ) > a- 1 (2nz/Tr) , 



(4) 



where T~L := Tr[JJ/9o] — Ho is the expectation value of the 
generator H above the ground level "Ho of H, and the 
function a^ 1 is the inverse of the function a(e) defined in 
Rcf . jn| which vanishes for e 1 and is approximated by 
4arccos 2 ( v / e)/7r 2 for e £ [0, 1]. In intcrfcrometry, H is the 
number operator of the optical mode p5[ and % is with 
the mean photon number N employed in the experiment. 
Replacing (f§) into (§) we find p5| 



Ay ^ Ay. R , where 
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AY^ := <j ^ / <ftt B(a:o<) 1 - y/l-a~^t) 
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with := n/(2H) and 



£(z) 



cfa; min[p(a;),p(x + z)\ (7) 



being a function which satisfies the constraint ^ E ^ 1 , 
with E(0) = 1 and E{z) ~ for z > W 

Asymptotic regimes: — The inequality (p ) is an exten- 
sion of the bound given in Ref. |D| , as it also contains the 
prior information in the function E. There is no guaran- 
tee that the accuracy Ay can reach Ay B : indeed most 
likely the achievable optimal threshold for Ay is larger, 
see also Still the inequality (||) is useful to analyze 
whether sub-Heisenberg scalings are possible or not. In 
particular, when the integral in (^) can be approximated 
by a constant, the factor Xq implies a 1/H scaling for Ay 
as requested by the Hcisenbcrg bound. This happens for 



instance in the LPI regime considered in [[i~3) where the 
Heisenberg scaling 1/% guarantees a large improvement 
over the prior uncertainty measured by W , i.e. when the 
ratio t := W/xq = 2HW/ir diverges. In this case in fact, 
independently from the specific form of the prior distri- 
bution p(x), one has E(x t) = E(Wt/to)\ to ^ > . 00 -> 1 for 
all values of the integral in (^|) and hence 

Ay L <r> := lim Ay B = x M = £- , (8) 
to-s-oo V 2 V 2 2H 



with A 



dt t[l - yji - a- 1 ^)} ~ 0.042 . (9) 



A similar result was derived in under the assump- 
tion of a uniform prior distribution p(x) , and by bound- 
ing a -1 of Eq. (Q) with a linear function. Since the 
asymptotic value Ay L ( B PI) does not depend on the spe- 
cific choice of the prior and has a Heisenberg scaling, we 
will use it as a benchmark to evaluate the performances 
of sub-Heisenberg strategies [^6| . 

Consider now the HPI regime in which the prior un- 
certainty is already much better than the one we could 
possibly get from a Hciscnbcrg-like scaling. Indicatively 
this can be identified by the condition to <C 1. Since for 
all t ^ 0, E{x t) = E(Wt/t ) vanishes when t o f^fl 
one expects that Ay B of Eq. (||) reduces drastically. As a 
consequence, the bound (0) no longer prevents Ay from 
dropping below the 1/T-L scaling defined by the bench- 
mark value of Eq. (§1). However, this is hardly surpris- 
ing: in the HPI regime the real question is whether or 
not one can get a significant improvement with respect 
to the initial uncertainty. Drawing general conclusions 
on this issue is difficult due to the complex dependence 
of (^) from the prior probability p(x). Interestingly, in 
the case of single-mode distributions p(x) (i.e. with a sin- 
gle maximum) we can show that the improvement on Ay 
vanishes for to — > 0. To prove this we note that for single- 
mode distributions Eq. (R) yields 



pVm+Xot/2 

E(xot) = 1—1 dx p{x) , 

Jy m —x t/2 
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with y m := j/o + Xot/2, where yo is the point where 
p(yo) = p(yo + xot) (such point is unique for single-mode 
distributions). Remembering that the probability distri- 
bution p{x) has a width W, from ([To| ) it is clear that for 
to — > the function E(xot) = E(Wt/t ) is non-null only 
when t ~ 0. Hence, to gauge the lower bound (|^) in the 
HPI regime we can remove the term in square brackets in 
(||), since a _1 (t) ~ 1 for t ~ 0. Accordingly, (^) becomes 

A 2 y B = f / dtt E(x t) = r + f / dt t 2 [p(y m + f) 
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FIG. 1: Comparison between the bounds on the precision of 
the estimation for uniform prior. The continuous line is the 
lower bound AY LB defined in ( |12|). The dashed line is the ac- 
curacy AX = Wj*JV2 = x t /V12 one gets by guessing the 
value of x from the prior distribution. The dotted line is the 
asymptotic limit AY L ( B PI) of (||) which provides the bench- 
mark for the Heisenberg scaling. For to > 1 the bound (^) 
implies that the relative gain Ay L g FI ' /AY one can achieve 
with sub-Heisenberg strategies is limited by a factor smaller 
than 2. For small to the gap increases, but in this case, one 
has to compare the performance of sub-Heisenberg strategies 
with the uncertainty AX of the prior which also reduces: the 
vertical segment indicates the maximum gain ~ 1.73. Here 
xo = 1. 

where the first equality follows from an integration by 
parts, whence T :~ XqE(xq)/A, and the second equality 
is a change of integration variables. If the prior p(x) has 
a variance A 2 X |27|], for to — >• the above expression is 



AV(HPI) 
OI LB 



lim AYl B = \/A 2 X + (y m - > AX(ll) 



where [i is the distribution mean, and where we used the 
fact that lim t0 _j. oo T = [|4|. The asymptotic inequality 
( pr| ) shows that the estimation error is lower bounded 
by the standard deviation AX of the prior distribution: 
the same precision of a strategy that just takes a random 
guess (according to the prior distribution) for the param- 
eter to estimate. Therefore, sub-Heisenberg schemes are 
useless in this regime: one can attain the same precision 
just by "guessing" the parameter. 

Intermediate regimes: — Having discussed the general 
behavior of the bound (||) for large and small to, here we 
analyze some examples of prior distributions for which 
the function AYl B can be explicitly evaluated. This al- 
lows us to study regimes which are intermediate between 
HPI and LPI, showing that sub-Heisenberg strategies 
can deliver advantages which, at most, are limited to a 
(small) constant factor with respect to the prior accuracy. 

Probably the most interesting case is when one has no 
prior information about the true value of the parameter, 
which corresponds to a uniform prior p(x) of width W 
with equal weight for all possible values |lq] . In this 
case, we have E(z) = 1 — z/W for z < W, and E(z) = 
otherwise. Thus, Eq. (o) becomes 



AY LB 
A(t ) 

B(t ) 



xo y/[A(t )-B(t )/t ]/2 , with 

min(to)l) 

dtt[l - \/l - a- 1 ^)} , 

min(io,l) 



(12) 
(13) 




dt t 2 [l - ^l-a- 1 ^)] , (14) 



FIG. 2: Alternative priors. As in Fig. [j], the continuous line 
represents the lower bound (|^) , the dashed line represents the 
prior uncertainty provided by the standard deviation AX of 
the prior, the dotted line is the asymptotic Heisenberg scal- 
ing (|8|), and xo — 1. (a) Gaussian with W = AX: the bound 
(|) here takes the form A 2 Y ^ f£ dt t[l - erf(-^-)][l - 

y/l — a -1 (£)]. The alternative bound ( |i7| ) is plotted as a 
dash-dotted line, (b) The bimodal distribution defined in the 
inset, with W = xoto and AX = xotoy /r TS/4S (dashed line) 
[here the asymptotic to — > behavior is AY = xoto i/7/48 
(dash-dotted line)]. 

which is plotted in Fig. [I]. In agreement with the anal- 
ysis of the previous section, for large t — > oo Eq. ( p^ ) 
yields the universal limit (|J) — indeed A(oo) = A and 
B(oo)/to — > 0. Moreover, the trivial estimation strat- 
egy of choosing a random value according to the prior 
distribution gives an error AX = W/Vl2 = xoto/V^- 
In the HPI regime (to <C 1) the term in square brackets 
in ( |l3"| ) and ([ll]) can again be removed since a~ x (t) ~ 1 
for t ~ 0. Hence, the lower bound (jl2|) gives AY LB = 
x \/[tl/2 - *o/3]/2 = W/VT2 = AX, matching the triv- 
ial random estimation procedure in agreement with ([ill ) ■ 
Furthermore, the largest gap between the lower bound 
( |i~2|) and the prior uncertainty is reached for to — .5 (ver- 
tical line in Fig. [l]): even assuming that a sub-Heisenberg 
strategy could reach (|l2]), this will only provide a relative 
gain of about 1.73 with respect to the initial uncertainty. 

Similar results have been obtained for other prior dis- 
tributions. In particular, in Fig. |^ we report the results 
obtained for Gaussian and for a bimodal step-like prior 
distribution. In all cases we record maximum gains of 
order 2 over the prior uncertainties. 

Conclusions: — Using the quantum ZZ bound recently 
introduced in |lj| we have shown that, independently of 
the prior information available at the beginning of the 
protocol, the resulting accuracy cannot beat the Heisen- 
berg scaling 1/T-L in the LPI regime (which is arguably 
the most relevant for practical implementations). More- 
over, we have shown that in the other cases where the 
prior is sufficiently large to allow for a sub-Heisenberg 
scaling, the resulting accuracy cannot attain a signifi- 
cant enhancement over the one available before starting 
the estimation procedure. 

A similar analysis can be done by replacing % with the 
standard deviation AH of the generator H . This yields 
a generalized uncertainty relation [|8| weighted by the 
prior: It is sufficient to replace the inequality (ftf) with 
the Bhattacharyya-like inequality p9| of Ref. p6|, i.e. 
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F(p x ,p x+Z ) ^ cos 2 (AHz) for AHz ^ ir/2. Replacing 
this into (0), we obtain 



AY>{ 5 1 
I 2 



dtt£?(5 t)[l-sin(t7r/2)]j , (15) 



where Jo := n / (2 AH). From this it follows again that in 
the asymptotic regime to — > oo one has AY ^ 5o\J A' /2 
where A 1 = 1/2 — A/tt 2 ~ 0.095 is a constant that does 
not depend on the prior. In contrast, for to — > 0, and 
assuming a single-mode prior, the bound (|l5|) becomes 
Ay AX, showing that the initial uncertainty again 
determines the scaling of the accuracy. 

Appendix: — An inequality slightly stronger than (0^ 
can be obtained by using the quantum ZZ-bound |l8[ 



Ay > 



+ 3C 



\ \ I dz z I dx \p{x) + p(x + 2)] 

l 

xPr e (x,x + z)| 2 , (16) 



where Pr e {x,x + z) is the minimum error probability 
of a hypothesis-testing problem which aims to discrimi- 
nates between the states Rq := p x and R\ := p x +z as- 
suming that they are produced with probabilities Pq = 
p(x)/\p(x)+p(x + z)] and Pi = 1 — Pq, respectively. Intro- 
ducing two purifications "to and of i?o and R\ respec- 
tively, which verify the identity F(po,pi) = F(^o,^i), 
the quantity Pr e {x,x + z) can be bounded as follows 

Pr e (x,x + z) > i [1 — H^b-Ro — 

> ||P *o--Pi*ilk] 



1- v / l-4P PiP(*o,*i) 



l-Vl-4W(fio,fii) 



where the first inequality derives from fl4|| (see also 
Rcf. [|is[ ) and the second from the fact that the trace 
distance is decreasing under the action of partial trace. 
Replacing this into (Eq) and using (0) we then find 



Ay > 



dtt 



+ 00 



1- a/1-4 



dx [p(x) + p(x + x t)] (17) 



p(x)p(x + Xgt) 

[p(x)+p(x+x t)} 2 



By direct evaluation it results that at least for values of xq 
and for the prior distributions analyzed in the text, ( |l7j ) 
provides a stronger bound than (||), see Fig. [|. However, 
in the LPI regime ([l7]) and (JsJ) give the same asymptotic 
constraint. Indeed, since p(x + xot) = p(x + Wt/to), it 
follows that for to — > oo, p(x + xot) — > p{x) and thus the 
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